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*~^ Abstract 

xn 
, ^, Consider the problem of a Multiple-Input Multiple-Output (MIMO) 

Multiple-Access Channel (MAC) at the limit of large number of users. 

^7^ Clearly, in practical scenarios, only a small subset of the users can be 

f— >) scheduled to utilize the channel simultaneously. Thus, a problem of 

OO user selection arises. However, since solutions which collect Channel 

j^ State Information (CSI) from all users and decide on the best subset 

to transmit in each slot do not scale when the number of users is large, 

distributed algorithms for user selection are advantageous. 

(^ In this paper, we suggest a distributed user selection algorithm, 

^^ which selects a group of users to transmit without coordinating be- 

t> tween all users and without all users sending CSI to the base station. 

k> This threshold-based algorithm is analyzed, and its expected capac- 

^ ity in the limit of large number of users is investigated. It is shown 

that for large number of users it achieves the same scaling laws as the 

optimal centralized scheme. Multi-stage distributed schemes are also 

considered. 



1 Introduction 

Wireless access networks are the typical last-mile networks connecting mul- 
tiple users to a high speed backbone. In these networks, a base station (BS) 
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serves a large group of users. Traditionally, either the time or the frequency 
were divided to ensure users do not interfere with each other. Modern cod- 
ing techniques, however, allow for multiple users to either transmit or receive 
simultaneously, and be decoded successfully using the appropriate Multiple 
Access Channel (MAC) codes or Broadcast Channel (BC) codes, respectively. 

Nevertheless, consider a BS serving a very large number of users. In prac- 
tical scenarios, not all users can be served simultaneously, and the problem 
of user selection or scheduling arises. In the downlink setting, where a BS 
transmits to a group of users, it is common to assume Channel State Infor- 
mation (CSI) is available at the BS, hence intelligent user selection can be 
employed. E.g., the BS can select a subset of the users with both strong 
channel norms (high SNR) and relatively orthogonal directions (to avoid in- 
terference). See Subsection |1.1| for a literature survey. Such a selection can 
exploit the multi-user diversity inherent in such channels. 

In the uplink setting, however, since the users transmit to the BS, it is 
highly desirable to avoid the process of collecting all information at the BS 
beforehand, and notifying the users which should transmit. This process is 
prohibitively complex when the number of users is very large. As a result, 
distributed algorithms for selecting the appropriate group of users are desir- 
able. This way, there is hope to harness the benefits of multi-user diversity 
without the need to collect CSI from all users. 

Main Contribution 

In this paper, we consider a Multiple Input Multiple Output (MIMO) MAC 
channel with r receiving antennas and K ^ r users. We suggest distributed 
algorithms for selecting a group of users to transmit in each slot. In the 
first, a threshold value for the norm of the channel vector is set, and only 
users above the threshold transmit. Hence, there is no need to collect CSI 
from all users. Nor is any cooperation required. In the second, an iterative 
process is suggested, where multiple thresholds are set on the norms of the 
projections of the channel vectors on the spaces of previously selected users. 
In this case, CSI is shared, but only among the selected group. An analysis 
of the resulting system capacity in the limit of large K and the respective 
scaling laws are given. The simple threshold based algorithm is shown to 
achieve the optimal scaling law. Thus, the benefit compared to traditional 
techniques is demonstrated both analytically and via simulation results. 



1.1 Related Work 

The essence of multi-user diversity was introduced in [1], where selecting the 
strongest user in each time slot was first suggested. The work was followed 
by numerous scheduling algorithms for various scenarios. We list here only 
the most relevant. 

In pj, the authors considered the impact of multi-user diversity on the 
MIMO downlink channel (BC). Assuming channel state information at the 
BS, the authors used order statistics to evaluate the effective SNR when 
scheduling the strongest user in each slot. However, only one user was sched- 
uled in each slot, and the results were given in terms of the K-io\d statistics, 
without an extreme value analysis for large K. 

In |3| , a similar downlink model was considered, however, when users are 
scheduled simultaneously. The authors considered Zero-Forcing Beamform- 
ing (ZFBF), and suggest a greedy algorithm to schedule the strongest and 
most orthogonal users. Additional scheduling algorithms for downlink com- 
munication were given in [H 13, El [7]. In fact, in the downlink scenario, it was 
showed later that ZFBF and optimal user selection can indeed achieve the 
Dirty Paper Coding (DPC) region [8], and is hence optimal in the Gaussian 
case [9]. Additional surveys and scaling laws can be found in [TOl [HI [12] . 

A closely related scheme, yet still for the downlink model, was suggested 
in [13]. Using Block Diagonalization (BD), a capacity-based greedy algo- 
rithm was suggested, in which first the strongest user is scheduled, and then 
additional users are added, one by one, based on their marginal contribution 
to the total capacity. In the same context, [14J considered the special case of 
two transmit antennas and one receive antenna per user, and showed that a 
greedy, two-stage algorithm, which first selects the strongest user and then 
the second to form the best pair is asymptotically optimal. In this paper, 
we suggest a single-stage algorithm for the uplink scenario, and, in addition, 
analyze its sum capacity in the limit of large number of users and give the 
resulting scaling laws. In the context of heterogeneous users, [T3] proposed 
a scheduling scheme which selects a small subset of the users with favorable 
channel characteristics. 

The above works focus on the downlink setting. In this scenario, it is 
reasonable to assume that at least some information is available at the BS, 
and a centralized decision can be made. In the uplink (MAC) model, however, 
if one wishes to select a group of users without centralized processing at the 
BS, distributed algorithms are required. 



In [ISl[I7j, a decentralized MAC protocol for Orthogonal Frequency Divi- 
sion Multiple Access (OFDMA) channels was suggested. In this scheme, each 
user estimates the channel gain and compares it to a threshold. Only above- 
the-threshold users can transmit. ^8] extended the scheme to a multi-channel 
setup, where each user competes on m channels. Recently, the authors pro- 
posed a Point Process approximation which facilitates the analysis of various 
distributed threshold-based scheduling algorithms in the non-homogeneous 
scenario p^ 120] . The work in [19] , however, assumes only a single user can 
be successfully decoded in each time slot. A key contribution in this pa- 
per is the non-trivial extension of the work in ^\ to truly multiple-access 
protocols, were several users transmit simultaneously and should be decoded 
successfully, and the question is how to distributively select a good subset 
of users to transmit. For example, a closely related work is [21]. Therein, 
various decoding procedures were discussed, and the corresponding best user 
selection for the uplink setting was given. However, while reinforcing the ne- 
cessity of proper user selection, the work in [21] considered only the scenario 
where one user can access the radio channel at a given time. 

As for more complex topologies, spatial diversity in the context of multiple 
relays was considered in [22j . Therein, communication between a source and 
a destination is done through a group of relays. However, unlike conventional 
relay schemes, only the relays with the strongest received signal decode the 
message and cooperate via space time coding to successfully relay it to the 
destination. An asymptotically optimal scheme for multiple base stations 
(with joint optimization) was given in p3]. 

Extreme Value Theory (EVT) is a key tool in proving capacity results 
under scheduling and multi-user diversity. In [21], the authors suggested 
a sub-carrier assignment algorithm, and used order statistics to derive an 
expression for the resulting link outage probability. In [25], the authors used 
EVT to derive the scaling laws for scheduling systems using beamforming and 
linear combining. [26] analyzed the scaling laws of base station scheduling, 
and showed that by scheduling the strongest among K stations one can gain 
a factor of 0{^/2\ogK) in the expected capacity (compared to random or 
Round- Robin scheduling) . 



2 Preliminaries 

2.1 System Model 

Throughout this paper, random variables are denoted in bold. We consider a 
multiple-access model with K users, each with a single transmit antenna. The 
BS is equipped with r receiving antennas. When k users utilize the channel 
simultaneously, the received signal at the base station can be describes as: 
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Xj + w, (1) 



i=l 



where Xj G C is the transmitted signal (scalar). Xj is constrained in its total 
power to P, i.e., ii^[xjxj] < P. However, in most cases, we will assume a 
constant power constraint P. w G C^ denotes the uncorrelated Gaussian 
noise, hj G C^'^^ is a complex random Gaussian channel vector. When all 
users are identically and independently distributed, it is common to assume 
the entries of hj have mean zero and variance 1/2 imaginary and real parts for 
all users. We assume that the channel is memoryless, that is, for each channel 
use (slot), independent realizations of {hj}^^ are drawn. Furthermore, we 
assume full CSI is available at the transmitter. That is, hj is known to the 
ith user. This can be accomplished by sending a pilot signal form each of 
the r antennas at the base station. 

2.2 Capacity and Multi-User Diversity Via EVT 

The capacity obtained by letting an arbitrary user i utilize the channel is 
given by C = log(l + P||hjp). However, as mentioned, it is beneficial to 
schedule the strongest user in each slot. Denote by h(i) the received channel 
vector with the largest norm. Scheduling the strongest user clearly results in 
C = log(l + P||h(i)|p), while letting a group of r orthogonal users, with the 
largest channel gains to utilize an uplink channel, results in a sum-rate that 
has the following upper bound: 

C<rlog(l + P||h(i)f). 

Remark. For the downlink channel, the sum-rate upper bound has the form p[5 

C<rlogfl + ^||h(i)f 



The difference between the uphnk to the downhnk originates in the power 
constraint apphed to the transmitter. That is, in the downstream, when 
transmitting to a group of receivers, each receiver gets a share of the available 
power, while in the upstream, there is a group of transmitters that transmit 
to a single receiver. It should be noted that usually the power constraint 
P for the downlink and uplink are not equal, since the base station has a 
strong and steady power supply, whereas the user has a limitted battery 
power supply. 

As the sum rate is mainly influenced by the channel vectors' gains and di- 
rections, our goal is to explore this behavior for large number of users. Specif- 
ically, we first wish to explore the behavior of the maximal gain. Since the 
entries of h are complex Gaussian, the channel's gain follows a x^-distribution 
(Chi-squared distribution) with 2r degrees of freedom, denoted xL- We uti- 
lize the following EVT theorem. 

Theorem 1 ( [271 [2H1 122] ) • Let xi,..x„ he a sequence of i.i.d. random vari- 
ables with distribution F{x), and let M„ = max(xi, ...,x„). // there exists a 
sequence of normalizing constants a„ > and hn such that as n ^ oo, 

Pr(M„ < a„x + hn) ^ G{x) 

for some non- degenerate distribution G, then G is of the generalized extreme 
value (GEV) distribution type 

G(x) = exp{-(l + ^a;)-i/«} 

and we say that F{x) is in the domain of attraction of G, where ^ is the 
shape parameter, determined by the ancestor distribution F{x). 

The normalizing constants and the shape parameter of the GEV can be 
obtained as follows. Let h{x) be the reciprocal hazard function 

, , , 1 - Fix) , p 

'^(^) = — TT^ — tor Xf "£ X < X , 

fix) 

where Xp = mi{x : F{x) > 0} and x^ = sup{x : F{x) < 1} are the lower 
and upper endpoints of the ancestor distribution, respectively. The shape 
parameter ^ is obtained as the following limit [271 [28] : 

liin J 

^ =x — )■ x^ -—h(x). 
ax 



When {x„} is a sequence of i.i.d. xlr variables, the asymptotic distribution 
of M„ is a Gumbel distribution [291 PP- 156]. Specifically, 



Pr(M„ < ttnX + br, 



where 



a^ 



= 2, (2) 

bn = 2(logn + (r-l)loglog?2-r(r)) + o(l). (3) 

However, for i.i.d. xL random variables, the convergence of the maxima 
to the Gumbel distribution using the above normalizing constants is quite 
slow. That is, the approximation of the maximal value will not be tight for 
moderate values of r, n e M. Hence, a more appropriate set of normalizing 
constants for the xir distribution, which takes into account both r and n 
should be derived. Letting bn to be the 1 — 1/n quantile, i.e., 1 — Fy/2(bn) = 
1/n, and choosing a„ = h{bn) [2Zl EH], we have the following. 

Claim 1. For the X2r"'^^'Sin6'uiion, the following normalizing constants apply. 
^M = ^r(r)exp|g-^(r,i)}g-^(r,i) ' (4) 

b{n,r} = 2Q'^ ir,-] +o{a{n,r}), (5) 
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where r(r) is the Gamma function and Q ^ {r, ■^) is the inverse of the regu- 
larized upper incomplete gamma function. 

Proof. The x^ distribution is a special case of the gamma distribution. I.e., if 
x ~ xlr then x ~ r(r, /3 = 2), where T{r, P = 2) is the Gamma distribution 
with shape parameter r and rate parameter /3. Accordingly, for the b[n,r} 
constant we consider the 1 — 1/n quantile of the Gamma distribution, which 
can be obtained by using the inverse of the regularized upper incomplete 
gamma function. In particular, 6{„,r} = f^Q'^ {r, }{} yields the 1 — 1/n quan- 
tile of the Gamma distribution. To attain the a{n,r} constant, let us examine 
the hazard function h{x) of the Gamma distribution. 
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Figure 1: Convergence of the normalizing constant a{n,r} for r = {2z}f^^ 
receiving antennas to the hmit a„ = 2. 



Accordingly, for x = 6{„,r} we obtain, 



a{n,r} = h {b{n,r}/(3) 

n 



rWexp{Q-(.,i)}o-'(..i) 



1-r 



n 



In Figure [T] we see the value of the normalizing constant a{n,r} for several 
values of receiving antennas r. This also depicts the rate of convergence of 
of a{n,r} to the a„ given in Q. 

In Figure [2] we see a tight convergence of the x^-distribution, for several 
values of receiving antennas r, to the limit Gumbel distribution when using 
the normalizing constants a{n,r}, &{n,r} given in Q and (tsl), respectively. 



2.3 Linear Decorrelation 

In this paper, we center our attention on linear decoding at the BS. Specifi- 
cally, focusing on the signal received from the jth user, rewrite ([I]) as: 

k 
y = hjXj + ^ hiXj + w. 
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Figure 2: Bars are simulation results, while the solid lines depicts the ana- 
lytical results of the max norm distribution with {4r}^^]^ degrees of freedom 
respectively, for K = 300 users. 



Let Yj be a unitary matrix representing the null space of the subspace 
spanned by {hjji^j. Since the entries of the channel vectors are i.i.d., when 
k users transmit the subspace spanned by the vectors {hjjj^j has rank k — 1 
with probability one [301 Chapter 8]. Thus, to decode, the receiver projects 



the received vector y on the subspace spanned by Vj, and nulls the inter- 
stream interference successfully (i.e.. Zero- Forcing receiver). Finally, the sig- 
nal of user j can be demodulated using a matched filter (i.e., maximal ratio 
combiner). The algorithm is thus as follows. 



ZERO-FORCING({hj}^, 
1 for j ^ 1 to k 


=i) 




2 
3 

4 


do H ^ [hi •■ • hj_ihL,+i 
Vj ^ nullspace{H) 


■■■hk] 



Note that a full degrees-of-freedom gain is attained when r users transmit. 
In this case, dim(Vj) = 1, and ||Vjhj|p ~ x^; ^.g., [30]. Accordingly, from 
this point on, we aim at algorithms which select at most r users (of the 
available K) in each time slot. As mentioned, since we focus on the scenario 
in which K ^ r, the set of selected users has a crucial affect on the system 
capacity. Optimally, a BS would receive CSI from all users, and schedule 
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the r best users for transmission. Under the hnear decorrelation above, the 
resulting expected capacity is 



max y^E\og(l + P\\Yihif) 

l,...K\m=r^-^ ^ ' 



XC{1,...X},|X| . 

However, we wish to avoid the overhead and complexity of such a centralized 
process, and select a group of users, approximating the optimal selection, 
distributively. 

3 A Distributed Algorithm 

A common approach to select a single user distributively, is a threshold-based 
procedure, in which a capacity threshold is set, and only user who exceeds 
it transmits {^M, lISj)- Of course, the events in which non of the users or 
several users exceed the threshold should be taken into account. In this 
paper, however, we wish to select a group of users, and analyze the resulting 
capacity. 

At the heart of the algorithms we suggest herein, stands a similar threshold- 
based procedure. However, the challenge is twofold. First, in selecting a 
threshold such that a favorable group of users exceed it. Second, in analyz- 
ing the results under the decorrelation procedure and at the limit of large K. 
We suggest two distributed algorithms. In the first, described in this section, 
a single threshold is utilized when decoding is done using linear decorrelation. 
In the second, described in Section |4| we offer a set of thresholds to match 
the Successive Interference Cancellation (SIC) procedure. 

Given the number of users K, we set a threshold Uk on the norm ||h|p, 
such that k <r strongest users exceed it on average. In each slot, each user 
estimates its channel's norm. A user with a norm greater than the threshold, 
transmits. We assume the transmission includes the channel vector as a low- 
rate preamble so the BS has the CSI of the transmitting users. The algorithm 
for user i is summarized as follows. 



Channel- AccEss(hj, Uk) 

1 if||hif >Mfc 

2 then transmit 

3 else keep silent 
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Proposition 1. For sufficiently large K, the expected system capacity of 
Algorithm Channel- Access with ZF decoding is given by 



ye 



-k 3 



EC{uk) = Y^ -^ Yl ^ [l°g (l + ^l|V^h,f ) I ||h,f > Uk\ + 0{1/K), 
j=i ^' i=i 

where k, to be optimized, is the expected number of users to exceed the thresh- 
old Uk, the {hj}^-[ are the channel vectors of the users who exceeded the 
threshold and {ViWzi ore the corresponding null spaces. 

Proof. According to the law of total probability, the expected capacity in 
a slot is determined by the number of users who exceed the threshold and 
the capacity each user sees, given that it exceeded the threshold. However, 
if more than r users are transmitting in a slot, then the receiver can not 
successfully null the inter-stream interference. Thus, in this case we say that 
a collision occurred, and the capacity in that slot is zero. Hence, the expected 
capacity do has the form: 

r j 

y^Pr{j users exceed} /^E [Ci\ ||hj|p > m^] . 
j=i i=i 

When the users are i.i.d., then the probability of j threshold exceedances 
follows the binomial distribution with probability p = k/K to exceed the 
threshold. Since we consider large K and small values of k, then the number 
of users to exceed threshold can be approximated by the Poisson distribution 
with an approximation error in the order of 1/K. D 

To ease notation, the approximation error is omitted from now on. 

Note that the receiver cannot recover more than r data streams. That is, 
if more than r streams are sent to the receiver simultaneously, then some re- 
dundant stream might demolish the other data streams. Hence, if more than 
r users begin transmission then we assume a collision occurs and the whole 
slot is lost. Similarly, since users act independently, a slot might be idle, if 
no user exceeded the threshold. Thus, we say that a slot is utilized if at least 
one user, but no more than r users, are transmitting. Note, however, that 
for each r, the effective SNR seen by the users who exceeded the threshold 
is different, as the dimension of Vj is r — j ' + 1. The complete proof is given 
in Section m 
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To evaluate the result in Proposition [T| the behavior of ||Vjhj|p should 
be understood, especially considering the fact that the number of users ex- 
ceeding the threshold is random. To this end, the following upper and lower 
bounds are useful. These bounds will be the basis for the scaling laws we 
derive. 

Lemma 1. The expected system capacity of Algorithm Channel- Access 
with ZF decoding satisfies the following upper bound. 

EC{uk) < Y^ — pj log (^1 + -(r - J + l){uk + a{K,r})) , 



where a{K,r} is given by Q and Uk is the threshold set such that k 
exceed it on average. 

Proof. By Jensen inequality. 



EC{uk) = ^^^E[log(l + P||V,h, 



users 



^ 1 i -k ^ 

~7! 



hill > Uk 
5^ log (l + PE [||V,hj2| ||h,f >uk\). (6) 



Consider the norm ||Vjhj|p, where V, has r — j ' + 1 columns. Denoting by 
V, the mth. column of V,- , we have 



£;[||v,h,f|||hJp>Mfc] 



E 



r-j+l 



E(^rU.) 



m=l 



\iiif > Uk 



(a) 

(J 

(J 

(d) 



r-j+l 



E^ 



m=l 



(rn) u \2 



Ih,; 



,2 (vr,h.) 



h!l 9 II "\ tV'^J II 9 
ill ll^i II 

r-j+l 



E[||h,,f|||h,f >,,,] J2 E 



m=l 



|hi|| > Uk 

111 119 ||-l 7-(m) 119 



E[\\h,f\\\h,f>Uk]{r-j + l) [ {l-ay-'da 

Jo 



{uk + aK){r-j + 1)-. 
r 
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In the above chain of equahties, (a) is since ||Vj p = 1. (b) is since hj is a 
random i.i.d. complex normal vector, and —^ — ' r^) is its angle from V , 

||hi|p||V^ II 

a vector in the null space of {h;};^j. Since these vectors are independent of 
hj, this angle is independent of the norm of hj. (c) is since the distributions 
of the norms and angles are independent of m, and since, by [HI Lemma 3.2], 
the angle has the same distribution as the minimum of r — 1 independent 
uniform [0, 1] random variables (i.e., with CDF 1 — (1 — a)''~^, < a < 1). 
(d) is the result of computing the expected norm of an i.i.d. complex normal 
random vector, given that it is above a threshold which is exceeded by only k 
out of K norms on average. The details are in Corollary [T| Section [5j 
Substituting in ([6]), we have 



j=i -^ 1=1 



ECiuk) < 22 —J- Yl ^°S (l + -{uk + ci{K,r}){r -3 + 1)^ 

< Yl -^^ ^°S (l + -(«fc + o/c) (r - J + 1)) , 



which completes the proof. D 

The bound in Lemma [1} while not giving the exact capacity, still depicts 
the essence of the system behavior. Set a threshold such that k out of the 
K users exceed it on average. This means the average exceedance rate is k, 
and indeed the expression ^^ — in the sum over j gives the probability for 
exactly j users exceeding. 

Remark. Note, however, that in practice it is beneficial to choose k slightly 
smaller than the number of antennas r. This is since if less than r users 
exceed, the SNR seen by each user is only larger, yet if more than r users 
exceed, the slot is lost. 

Each of the j users, under zero forcing, experiences a single user channel, 
with its power P scaled according to two factors: (i) a multiplication by 
{uk + ax), as this is the average norm of its channel vector, where Uk is the 
threshold exceeded, and ax is the average distance above the threshold, (ii) 
a multiplication by ''"^"'""'^ , as in case only j < r users exceeded the threshold, 
the zero forcing algorithm does not have to cancel r — 1 users, only j — 1, hence 
the null space has a larger dimension, yet the number of receive antennas is 
r. Indeed, as it turns out in the simulation results, the bound in Lemma [l] is 

13 



tight even for relatively small number of antennas and users, and, moreover, 
gives the exact scaling law. 

Remark. Note that if less than r users exceed, then a higher SNR can be 
attained at the receiver. However, in order to achieve the capacity in this 
case, a user must know how many users exceeded along, so it can exploit 
the high SNR for e.g., higher transmission rate. Hence, we require that the 
number of users that actually exceeded the threshold will be announced. 

Let us derive a corresponding lower bound. 

Lemma 2. The expected system capacity of Algorithm Channel- Access 
with ZF decoding satisfies the following lower bound. 



EC{uk) 



> 
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'1 



a] 



\r-2 



log (1 + Puka) da, 



where Uk is a threshold set such that k users exceed it on average. 
Proof. Following the derivations of the upper bound, we have 



EC(ut) 












E^ 



k i 



log 1 



r-j+l 



m=l 
r-j+1 



j^Eiog 1+Pizfc Y. 



Ilh I|2||V(™)|I2 
ll"«ll II ^ i 



> Uk 



(vr'^h,)^ 



1=1 



m=l 



|h,;||2||V 



(m) I 



(fe) ^^ k^e 



j! 



-jE\og{l 



puk J2 

m=l 



hll 9I|-V7"("*) 119 
i'll ll^i' 



——jE\og\l + Puk ,,, 

j=l -^ \ ll'-'i'll II ^ i' 



(c) ^^ k^e 



J- Jo 



1)(1 



a 



,r-2 



log (1 + Pu^a) da 



where (a) is since the norms of all users participating are above the threshold 
Uk\ (b) is since the angles in the inner sum are identically distributed and 
indepenent of i, hence an arbitrary 1 < i' < j can be used; (c) is by explicitly 
computing the expectation over the angle between hj and VJ , remembering 
that it has a density (r — 1)(1 — aY"'^ for < a < 1. This completes the 
proof. n 
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The results above lead to the following scaling law, which is the main re- 
sult in this section. It asserts that the scaling law of r log P log K for the sum 
rate in a multi-user system can in fact he achieved distributively, without col- 
lecting all channel states from all users and scheduling them in a centralized 
manner. In other words, the threshold based algorithm suggested selects an 
optimal set of users (asymptotically in the number of users) distributively 
and without any cooperation. 

Theorem 2. The expected system capacity of Algorithm Channel- Access 
with ZF decoding scales as G(r log(Plog K)) for large enough number of users 
K. 

Proof. By Lemma [T| 

2p J p ~ "^ / 

< 2^ — pj log (^1 + PK + ax) 

< r\og{l + P{uk + aK)). 



On the other hand, consider the lower bound given in Lemma [2j Since fc is a 
parameter to be optimized, the optimum is at least as large as when choosing 
k = r. We have 

where the last inequality is by evaluating the sum at r = 2. Note that larger 
values of r give only slightly larger values, with a limit of 0.5 as r — ;■ ooq 

Now, consider the integral over a in Lemma [2j This integral is a know 
hypergeometric function (which can be handled in Mathematica), which by 
an infinite sequence of integration by parts, can be expressed as an infinite 
series as follows. 

(r - 1) f\l - ay-' log (1 + Pua) da=—f] ''^~''^' 



,^ + r)l 



^This is the CDF of a Poisson random variable with parameter r, calculated at r — 1. 
The limiting behavior can be found in [31j . 
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Figure 3: Bars are simulation results, while the solid and dashed lines repre- 
sent upper and lower bound results respectively. Expected capacity under a 
single threshold algorithm and K = 300. The threshold Uk is set such that k 
users exceed it on average. Bars are simulation results, while the solid and 
dashed lines represent the upper and lower bounds, respectively. The green 
(upper), red (middle) and blue (lower) lobes are for r = 8,4 and 2 receive 
antennas, respectively. Note that the optimal k is smaller than r. 



The resulting expression can be symbolically evaluated for integer values of 
r (the number of antennas). E.g., for r = 4 we have ([T]) at the top of the 
next page. As a result, for large u, the integral is approximately logu. 

Since we consider the regime of large enough number of users K yet a 
finite number of antennas r, we have k = 0(1) and as a result Uk = 0(log K). 
In fact, since the distribution of the projected channel gain seen by a user in 
this decoding scheme is the exponential distribution with rate 1/2, it can be 
shown that Uk = 2(logi^ — log A;) (we discuss the threshold value in detail in 



Section 5.1). This gives rise to the 0(rlog(l + PlogK)) scaling law. 

In Figure [3] we simulate the expected capacity of the threshold-based 
scheduling scheme, with ZF-receiver and compare the simulation results to 
the upper and lower capacity bounds given in Lemma [T] and Lemma |2| re- 
spectively. 

n 
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r-1)/ {1 -ay-^ log (l + Puka) da 
Jo 

-6m- 15^2- llM^ + 61og(l + M) + 18Mlog(l + M) + l8u^\og{l + u) + Qu^\og{l + u) 

(7) 



4 A SIC-Based Algorithm 

In the previous section only a single threshold was used, and a user's data was 
decoded by projecting the received signal on the null space of the sub-space 
spanned by the channels of the interfering users. However, the receiver can 
use SIC [Wi, Ch.6], using the decoded signal of a previous user to decode the 
next one (by subtracting it from the received stream). After r iterations, all 
data streams are decoded. With this in mind, we derive a second, iterative 
algorithm, to further utilizing the benefits of SIC. At first glance, for large 
number of users, a set of thresholds {m*^*^} should be chosen, such that in 
each iteration only a single user exceeds on average. 



SIC-Channel-Access() 

1 H^{} 

2 ho ^h 

3 u^^) ^Q-^r~i + 1, j^) 

4 for ? ^ 1 to r 

5 do if ||hop > -u^*^ 

6 then transmit h*^*^ 

7 else receive hl-.H -i^ H[Jh}.. 

8 V ^ nullspace{H) 

9 ho = Vho 

However, even though the thresholds are chosen that way, it is more than 
likely that in some of the r iterations more than one user, or no user, will 
exceed the threshold. Specifically, the probability that exactly one user will 
exceed in an iteration is approximately 1/e. Yet, if a carrier sense, or collision 
detection mechanism is available, collisions can be resolved by allocating a 
few mini-slots devoted to finding the strongest user in each iteration p^ 
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This gives rise to the following algorithm: After the strongest user 
is found, it begins its transmission by announcing its channel vector. In 
the ith iteration, the rest of the users project their channel vector on the 
orthonormal basis V(j_i), which spans the null space of the channels vectors 
announced thus far. Now, the user for the ith iteration is the one with the 
strongest projection. The process ends when r users are selected. Note that 
this involves announcements of the channel vectors only from the r <^ K 
selected users. 

When a user projects its channel on V(j-i), the resulting gain distribution 
follows the x^-distribution with 2(r — i + 1) degrees of freedom. That is, 
||V(j_i)h(j)p ~ xlir-i+i) I3n]- Let h*^*) be the channel of the strongest user in 
iteration i. Utilizing the EVT for the X2(r-i+i)"'^i^t^ibution, the gain ||hW||2 
is characterized, and we have the following. 

Proposition 2. For sufficiently large K, the expected system capacity of 
Algorithm SIC-Channel- Access with ZF-SIC decoding is given by: 



a.({w«}Li) = J2e [log(l + P||V(,_i)h( 



.«l|2 
-^ l^^SK^ I - II " (i-l)- 

4 = 1 



where h*^*^ is the channel vector of the strongest user after the ith projection, 
and {V(j_i)}^^;^ are the corresponding null spaces. 

To evaluate the performance of Proposition 2 the distribution of || V(j_i)h'^*) ||^ 
should be examined. The following upper and lower bounds will be the basis 
for the scaling laws we derive. 

Lemma 3. The expected system capacity of Algorithm SIC-Channel- Access 
with ZF-SIC decoding satisfies the following upper bound. 

r 

Cav < ^ log(l + P{b{K,r^i+l} + ia{K,r-i+l})), 
i=l 

where a{K,r-i+i},b{K,r~i+i} o-^e the normalizing constant given in Oj and (^ 
respectively, and 7 ~ 0.57 is the Euler Gamma constant. 



Proof. By Jensen inequality, 

^a.({««}[=l) 

r 

= ^E[log(l + P||V(,_i)h«f^ 

r 

<^log(l + PEO|V(.-i)h«f]). 

Since the norm ||V(j_i)h*^*^||^ follows X2(T--i+i)"*iistribution, then, by Theo- 
rem ll ||V(j_i)h*^*)|p follows the Gumbel distribution with normalizing con- 
stants a{K^r-i+i},b{K,r-i+i} givcu in (|4|,([5]) respectively. Thus, the expected 
maximal norm has the expected value of the Gumbel distribution. Thus, 
Lemma |3] follows. D 

Lemma 4. The expected system capacity of Algorithm SIC-Channel- Access 
with ZF-SIC decoding satisfies the following lower bound. 



a. >5^1og(l + 






"^IokKJ 



where ul^^ is a threshold such that \ogK strongest users exceeds it on aver- 
age, in iteration i. 

Proof. Let us consider the probability that no user exceeds the threshold 
■"logx °^ iteration i. This can be expressed as 

pr(iiv,_,)h«f<.;:U) = (i+^)" 

-^ l/K 
Hence, 

r r 

Y,E [log (l + P||V(,_i)h«f )] > J]log(l + n« ^). 
Thus, Lemma |4] follows. D 
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Figure 4: Simulated probability density function of the system capacity, 
where the receiver equipped with r = 4 receiving antennas and there are 
K = 300 users equipped with a single transmitting antenna. The left (pink) 
lobe depicts the capacity when a group of r = 4 users are scheduled, and 
the receiver demodulate the data streams by null forcing. The right (blue) 
lobe depicts the capacity when a group of r = 4 users are scheduled, and the 
receiver demodulate the data streams by SIC. 

Remark. Note that the SIC scheme achieves the scaling laws of the former 
linear decorrelation scheme, yet, it also achieves higher power gain, i.e., there 
is a boot in performance since the effective SNR is increasing in each iteration. 

In Figure |4] we compare the capacity distribution, when the receiver using 
ZF and ZF-SIC to decode the data streams. 

In Figure [5] we depict the gain of multi-user scheduling with and without 
SIC, and compare it to scheduling schemes which schedule a single user at a 
time. 



5 Proofs 

The methods discussed in the previous sections are based on a common 
baseline procedure. First, a threshold is estimated. This threshold is set 
according to the fraction of users which are required to exceed it on average. 
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Figure 5: Simulated probability density function of the capacity with r 
= 4 and K = 300. Green (left): a single arbitrary user is scheduled (ap- 
proximately Gaussian). Yellow (second left): the strongest user is scheduled 
(approximately Gumbel). Pink (second right): a group of 4 strongest users 
are scheduled, with ZF receiver. Blue (right): a group of 4 users are sched- 
uled, with the suggested multi-threshold algorithm and ZF-SIC receiver. 
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In this paper, the threshold is set on the channel vector norm, either directly 
or after projecting it on the null-space of the already chosen users. Then, 
given that the norm threshold was exceeded, we wish to get a handle on 
the capacity distribution. Finally, we wish to express the rate at which users 
exceed the threshold, in order to understand how likely is that a given number 
of users exceed a certain threshold. In this section, we discuss the following 
problems directly for the problem at hand. 

5.1 Threshold Estimation 

Let Uk be a threshold such that only the k users with the strongest channel 
norm ||hj|p will exceed it on average. Since ||hj|p follows a chi-squared 
distribution X2r(^) with 2r degrees of freedom, Uk can be easily estimated 
using the Inverse-Gamma function. 

Claim 2. The threshold Uk, such that k out of K users will exceed it on 
average is 

Uk = 2Q-\r,k/K) 

Note, however, that Claim |2] does not give any intuition on the actual 
threshold value, or, more importantly, how it scales with K for fixed k. For 
this, it is possible to approximate Uk using EVT, similar to [19] . 



Claim 3. Set c = mk, m G N+. The threshold Uk can he approximated as 
follows. 

Uk = hK — Uk log < —log 11 J > + o iai 

Proof. We first estimate a threshold Up. Let c = mk, m G N''', be a constant 
number of maximal users, i.e., each has a maximal norm among a randomly 
selected group of ^ users. We set Up such that only a fraction p of these c 
maximal users will exceed. 

For large enough K, each of these users experiences a norm llhjp which 
is distributed according to the extreme value distribution discussed earlier 
for the X(2r) distribution. Thus, for these users, we estimates the threshold 
by a simple quantile function, 

1 - Go{up) = p. 
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For such Up, we have 




we obtain 












Up 


= bK- 

c 


- ax log{- 

c 


-log(l 


now set Uk 


= Uk. 









G[up) = exp <— e ^^ " "^ > = 1 — p 



P)}- 

n 

Note that no matter which normahzing constants a^ and 6„ are used, 
either according to ^ and (tsl), or the ones derived in this work for different 
values of r (to allow faster convergence to the EVD), i.e., Q and (tsl), in both 
cases we have, ii k = 0(1), Uk = 0{logK). This is crucial for the rloglog-fr 
scaling law. 

5.2 Threshold Arrival Rate and the Capacity Tail Dis- 
tribution 

Once a threshold is set, it is important to evaluate both the distribution of 
the number of users which exceed it as well as the conditional distribution 
of the norm given that the threshold was exceeded. Herein, we utilize the 
Point Process Approximation |32] and its specific usage for threshold arrival 
rates in the single user case [19] in order to derive these distributions for the 
problem at hand. 

Assume that xi,...,x„ is a sequence of i.i.d. random variables with a 
distribution function F{x), such that F{x) is in the domain of attraction 
of some GEV distribution G, with normalizing constants a„ and 6„. We 
construct a sequence of points Pi, P2, ••• on [0, 1] x M by 

Pn= { { -,— ] ,i = 1,2, ...,n 

n a„. 



L, ^, ..., .1, / , 



and examine the limit process, as n — )■ 00. Consider P„ on the set [0, 1] x 

{bi + e, 00), where e > 0. By [32j, P„ — > Pasn — ^ cxd, where P is a non- 

_i_i 
homogeneous Poisson process with intensity density A(t,x) = (1 + ^x)_,_* 

X is the sample value, and t is the index of occurrence. In fact, in the i.i.d. 

case, the process intensity density \(t,x) is independent of t. 
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Let A.{B) be the expected number of points in the set B. A{B) can be 
obtained by integrating the intensity of the Poisson process over B, That is 
A{B) = J^ ^ \(h)dh. As we are interested in sets of the form B^ = [0, 1] x 

(f , oo), where v > bi (threshold exceedance), we have A(i?^) = (1 + ^f )+ , 
where a+ denotes max{0,a} (e.g., [19]). That is, the number of users whose 
channel norms exceed a threshold can be modeled by a Poisson process, with 
parameter A {B^). Note that for ,^ — )■ 0, as in the case of the x^ distribution, 
we have A (B^) — )• e~^, hence setting a threshold logi^ — log A; results in a 
per-user arrival rate of -^, and, as a result, a total arrival rate of k. 

To compute the conditional distribution, we proceed similar to [19]. For 
any fixed v > bi, let Un{v) = anV + bn, and let x > 0. Then 

Pr (xj > a^x + u„(f)|xi > u„(i;)) 

^Pr(P(t) > x + v\P{t) >v) 

A(5„) 

1+e 



1 + ^v 



where P„(t) and P{t) are the corresponding excess value ^'~''" at index t, and 
the corresponding excess value at time t in the limit process, respectively. 
Namely, the result is a generalized Pareto distribution, GPD{an<Jv,^). 
Note that for the x^ distribution, ^ — )■ 0, and we have 

Pr(x — Univ) < a|x — M„(f ) > 0) = 1 — e~^ 

for all a > 0. That is, the tail distribution can be approximated using an 
exponential distribution with rate A = l/a^. As a result, taking the expected 
value, we obtain the following corollary. 

Corollary 1. The expected capacity seen by each user among j users who 
exceeds the threshold u is log{l + P[{u + aK){r — j + 1)]}. 

5.3 Throughput Analysis 

As mentioned, we say that a slot is utilized only if a / < r users transmit 
in that slot, otherwise, the whole slot is lost. In particular, a threshold 
exceedance of more than r users coincides with the event of more than r 
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points that retains in the hmit point process. Similarly, when no user exceeds 
the threshold, it reflects in the event of empty sets in the point process. 

Claim 4. For a threshold Uk we have: Pr ( utilized slot ) = X][=i fr^^^- 

Proof. The probability that at most r out of K users will exceed Uk, follows 



Pr( collision ) = 1 - ^ ( / J (^ k J ^k 

K{K - 1)...{K -l + l)k^{l-k) 



1=0 



E 



K^ i\ ii-ky 

1=0 ^ ' 



K-^oo 



^ i-E^^ 



k^ 
1\ 



1=0 

The last step is actually the Poisson approximation to the Binomial distri- 
bution with parameters B{K, ^). Similarly, under the same settings, the 
probability of an idle slot is 

Pr { idle slot }=fl-^y ""-^ e-K 

Since Pr ( utilized slot ) = 1 — Pr ( idle slot |J collision ), Claim 111 follows. 

n 

Remark. If more than r users exceed, then one (redundant) vector is a linear 
combination of at most r independent vectors. Hence, it's interference will 
not be canceled in the nulling process. In this case, we assume that all the 
streams will be corrupted and the slot will be lost. 

Remark. Note that the strongest users channel correspond to the capacity of 
linear decorrelator in high SNR regime, in which maximum degrees of free- 
dom is achieved. Hence, our suggested user selection achieves the capacity. 

Remark. In the SIC the users are naturally ordered from the strongest to the 
/-strongest user. So they can, maybe, do some distributed water-fllling, such 
that each user will invest power amount proportional to his order. 

Remark. Note that Hassibi had scaling laws of loglogK when a single user 
is scheduled and the number of users is large. In the linear decorrelation 
scheme we the same scaling laws for each user. However, since we schedule a 
group of users we'll have a factor of / comparing to the single user scheduling 
case. 
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